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PNEUMOCOCCAL PROTEIN HOMOLOGS 
AND FRAGMENTS FOR VACCINES 



This application claims the priority of U.S. Provisional Application 
60/150,750, filed August 25, 1999, the disclosure of which is hereby 
incorporated by reference in its entirety. 

FIELD OF THE INVENTION 

This invention relates generally to the field of bacterial antigens and their 
use, for example, as immunogenic agents in humans and animals to stimulate 
an immune response. More specifically, it relates to the vaccination of 
mammalian species, especially humans, with one or more polypeptides derived 
from gram positive bacteria and which show sequence homology with an 
immunogenic polypeptide obtained from Streptococcus pneumoniae. 

BACKGROUND OF THE INVENTION 

Polypeptides derived from gram positive bacteria are useful for 
stimulating production of antibodies that protect the vaccine recipient against 
infection by a wide range of serotypes of pathogenic gram positive bacteria, 
including S, pneumoniae. Further, the invention relates to antibodies against 
such polypeptides useful in diagnosis and passive immune therapy with respect 
to diagnosing and treating such pneumococcal infections. 

The genus Streptococcus contains a variety of species responsible for 
causing disease in mammals, including humans, while also encompassing 



species that constitute normal flora in humans and other mammals. Among the 
bacterial species implicated in the etiology of diseases in humans are S, 
pyogenes (part of the group A streptococcal bacteria, herein designated "GAS" 
for "group A streptococci"), S, pneumoniae (referred to as "pneumococcus") 
5 and S, agalactiae (the group B streptococci or "GBS"). The group A 
streptococci cause serious diseases such as necrotizing fasciitis, scarlet fever 
and sepsis, as well as less virulent diseases such as impetigo and pharyngitis. 
The pneumococci are the most common cause of community-acquired 
pneumonia and are also responsible for more than half of all cases of otitis 
10 media in children. The pneumococci are also the second most common 
pathogen associated with bacterial meningitis. The group B streptococci are 
the most prevalent pathogen associated with illness and death among 
newborns in the United States. 

15 Currently, there are no vaccines available for the prevention of diseases 

caused by the group A and group B streptococci and presently available 
pneumococcal vaccines are not effective in children under 2 years of age or in 
the elderly due to the poor immunogenicity of the capsular carbohydrates that 
compose the current vaccine. It would therefore be highly advantageous to 

20 produce a vaccine that would prevent infection by these classes of pathogen, 
especially in the age groups mentioned. 

In addition to the pathogens just described, some bacteria of the genus 
Staphylococcus are also of clinical importance. In fact, two of these are among 

25 the leading causes of nosocomial infections (infections acquired while in the 
hospital). Both Staphylococcus aureus and Staphylococcus epidermidis readily 
colonize the skin of healthy individuals and can cause acute disease in patients 
following immunosuppression or traumatic injury. Infections caused by these 
species include bacteremia, endocarditis, osteomyelitis, wound infections and 

30 infections associated with indwelling catheters. 



2 



Streptococcus pneumoniae is a gram positive bacterium that is a major 
causative agent in invasive infections in animals and humans, such as the 
aforementioned sepsis, meningitis, and otitis media, as well as lobar 
5 pneumonia (Tuomanen, et al. New England J. of Medicine 322:1280-1284 
(1995)). As part of the infection process, pneumococci readily bind to non- 
inflamed human epithelial cells of the upper and lower respiratory tract by 
binding to eukaryotic carbohydrates in a lectin-like manner (Cundell et al., 
Micro. Path. 17:361-374 (1994)). Conversion to invasive pneumococcal 
10 infections for bound bacteria may involve the local generation of inflammatory 
factors which may activate the epithelial cells to change the number and type 
3 of receptors on their surface (Cundell, ef al., Nature, 377:435-438 (1995)). 

1 Apparently, one such receptor, platelet activating factor (PAF) is engaged by 
W the pneumococcal bacteria and within a very short period of time (minutes) 

2 15 from the appearance of PAF, pneumococci exhibit strongly enhanced 

adherence and invasion of tissue. Certain soluble receptor analogs have been 
S shown to prevent the progression of pneumococcal infections (Idanpaan- 

B Helkkila et al., J. Inf. Dis., 176:704-712 (1997)). A number of other proteins 

S have been suggested as being involved in the pathogenicity of S. pneumoniae. 

3 20 

Streptococcus pneumoniae itself has been shown to contain a gene 
which encodes a protein designated herein as Sp36. This protein has a 
predicted molecular mass of 91,538 Da and contains 5 histidine triad motifs 
(proposed to be involved in metal binding). The gene encoding this protein 
25 appears to be present the 23 serotypes comprising the current commercially 
available pneumococcal-capsular vaccine. Immunization of mice with this 
protein, in the presence of Freund's adjuvant, stimulates an immune response 
which protects these mice from an intraperitoneal challenge with a dose of 
virulent pneumococci that would normally kill the mice. 

30 
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For the reasons already stated above, there not only remains a need for 
identifying polypeptides having epitopes in common from various strains of S. 
pneumoniae but also from a broader spectrum of gram positive bacteria in 
order to utilize such polypeptides as vaccines to provide protection against a 
wide variety of infectious organisms. 

BRIEF SUMMARY OF THE INVENTION 

In accordance with the present invention, there is provided vaccines that 
include polypeptides obtained from gram positive bacteria other than S. 
pneumoniae, as well as variants of said polypeptides and active fragments of 
such polypeptides. 

The present invention is also directed to novel genes, and the 
polypeptides encoded thereby, derived from gram positive bacteria other than 
pneumoniae, and which bear sequence homology to the Sp36 gene already 
described. Such gram positive bacteria include the group A and B streptococci, 
as described herein, as well as species of the genus Stapiiy/ococcus, especially 
S. aureus. 

In a particular embodiment, the present invention is directed to specific 
gene sequences, and proteins encoded thereby, derived from the group A and 
group B streptococci, and to the use of such expressed polypeptides and 
proteins as the basis for pharmaceutical compositions useful as vaccines and 
as a means for enabling isolation of antibodies with therapeutic and/or 
prophylactic activity (such as would be useful in preparing products like 
CytoGam). 

In a further embodiment, the present invention also relates to the 
preparation and use of fragments of the novel polypeptides disclosed herein. 



such fragments being immunogenic in nature and being useful in the 
preparation of vaccines against diseases caused by the pathogens from which 
such polypeptides, and fragments thereof, are derived. 

5 



BRIEF DESCRIPTION OF DRAWINGS 

10 

Figures 1 shows the results of a Southern blot of genomic DNA from 
S, aureuSr S, pyogeneSr and pneumococcus. The DNA was digested with 
restriction nucleases BdmH\ or PvuW, and after electrophoresis and transfer 
to a nylon membrane, was probed with a labeled DNA fragment 
15 encompassing the pneumococcal gene encoding Sp36. The hybridization and 
washes were carried out under low stringency conditions. The results show 
hybridization by the labeled probe to a S. aureus fragment in both the BamHl 
and Pvull lanes and to two fragments in the PvuW digests of two strains of S. 
pyogenes, 

20 

Figures 2 shows an alignment between the Sp36 amino acid sequence 
from S. pneumoniae strain N4 and the homologous sequences from S, 
pyogenes and S, agalactiae. Amino acids identical to those of the 
polypeptide from S. pneumoniae are boxed. 

25 

Figure 3 shows the results of a Southern blot of genomic DNA from S. 
pyogenes, S, agalactiaer and 5, pneumoniae probed with DNA encoding the 
full length Sp36 homolog from S. pyogenes. The hybridization was carried 
out under low stringency conditions. These results demonstrate that the S. 
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pyogenes Sp36 homolog, used as a probe, is capable of detecting a 
homologous gene in S. agalactiae and pneumococcus. 

Figure 4 shows the results of a western blot using rabbit polyclonal 
antiserum generated against recombinant Sp36 protein cloned from S. 
pneumoniae strain Norway 4. The results demonstrate that this antiserum 
not only reacts with the protein against which it was raised {here, Sp36), as 
well as to a protein of similar size in a lysate of a serotype 6B strain of 
pneumococcus, but also reacts with a recombinant protein encoded by the 
Sp36 homolog gene of group B streptococci. 

Figure 5 shows the amino acid sequence for the GAS36 homologs 
with the histidine triad regions underlined (Fig. 5(a) and (b)) and the 
sequence for a GBS36 homolog (Fig. 5(c)) with its histidine triad regions 
underlined. 



DETAILED DESCRIPTION OF THE INVENTION 

The present invention is directed to novel polynucleotides and 
polypeptides derived from species of gram positive bacteria, especially group A 
and 8 streptococci, and including the genus Staphylococcus, most especially 
pyogenes (GAS), S. agalactiae (GBS), and S. aureus, respectively. 

Further, the present invention is directed to polynucleotides derived 
from gram positive bacteria and which are at least partially homologous to the 
polynucleotides making up the gene coding for the previously disclosed Sp36 
gene of S. pneumoniae (U.S. Application Serial No. 60/1 13,048). 



The present invention is also directed to polynucleotides, and 
immunologically active fragments, segments, or portions, thereof, which 
polypeptides are encoded by the polynucleotides disclosed herein. 

5 The present invention also relates to such polynucleotides and 

polypeptides in enriched, preferably isolated, or even purified, form. 

In accordance with the present invention, the term "DNA segment" 
refers to a DNA polymer, in the form of a separate fragment or as a 

10 component of a larger DNA construct, which has been derived from DNA 
isolated at least once in substantially pure form, i.e., free of contaminating 
endogenous materials and in a quantity or concentration enabling 
identification, manipulation, and recovery of the segment and its component 
nucleotide sequences by standard biochemical methods, for example, using a 

15 cloning vector. Such segments are provided in the form of an open reading 
frame uninterrupted by internal nontranslated sequences, or introns, which 
are typically present in eukaryotic genes. Sequences of non-translated DNA 
may be present downstream from the open reading frame, where they do 
not interfere with manipulation or expression of the coding regions. 

20 

The nucleic acids and polypeptide expression products disclosed 
according to the present invention, as well as expression vectors containing 
such nucleic acids and/or such polypeptides, may be in "enriched form." As 
used herein, the term "enriched" means that the concentration of the 

25 material is at least about 2, 5, 10, 100, or 1000 times its natural 
concentration (for example), advantageously 0.01 %, by weight, preferably at 
least about 0.1% by weight. Enriched preparations of about 0.5%, 1 %, 
5%, 10%, and 20% by weight are also contemplated. The sequences, 
constructs, vectors, clones, and other materials comprising the present 

30 invention can advantageously be in enriched or isolated form. 
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"Isolated" in the context of the present invention with respect to 
polypeptides (or polynucleotides) means that the material is removed from its 
original environment (e.g.r the natural environment if it is naturally occurring). 
5 For example, a naturally-occurring polynucleotide or polypeptide present in a 
living organism is not isolated, but the same polynucleotide or polypeptide, 
separated from some or all of the co-existing materials in the natural system, is 
isolated. Such polynucleotides could be part of a vector and/or such 
polynucleotides or polypeptides could be part of a composition, and still be 
10 isolated in that such vector or composition is not part of its natural 
environment. The polypeptides and polynucleotides of the present invention 
S are preferably provided in an isolated form, and most preferably are purified to 

£ homogeneity. 

m 

lar ;: 

H 1^ 

15 The polynucleotides, and recombinant or immunogenic polypeptides, 

y I 

disclosed in accordance with the present invention may also be in "purified" 

m form. The term "purified" does not require absolute purity; rather, it is 

!^ intended as a relative definition, and can include preparations that are highly 

O purified or preparations that are only partially purified, as those terms are 

* »' 

20 understood by those of skill in the relevant art. For example, individual 
clones isolated from a cDNA library have been conventionally purified to 
electrophoretic homogeneity. Purification of starting material or natural 
material to at least one order of magnitude, preferably two or three orders, 
and more preferably four or five orders of magnitude is expressly 

25 contemplated. Furthermore, claimed polypeptides having a purity of 
preferably 0.001 %, or at least 0.01 % or 0.1 %; and even 1 % by weight or 
greater is expressly contemplated. 

The term "coding region" refers to that portion of a gene which either 
30 naturally or normally codes for the expression product of that gene in its 
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natural genomic environment, i.e., the region coding in vivo for the native 
expression product of the gene. The coding region can be from a normal, 
mutated or altered gene, or can even be from a DNA sequence, or gene, 
wholly synthesized in the laboratory using methods well known to those of 
skill in the art of DNA synthesis. 

In accordance with the present invention, the term "nucleotide 
sequence" refers to a heteropolymer of deoxyribonucleotides. Generally, 
DNA segments encoding the proteins provided by this invention are 
assembled from cDNA fragments and short oligonucleotide linkers, or from a 
series of oligonucleotides, to provide a synthetic gene which is capable of 
being expressed in a recombinant transcriptional unit comprising regulatory 
elements derived from a microbial or viral operon. 

The term "expression product" means that polypeptide or protein that is 
the natural translation product of the gene and any nucleic acid sequence 
coding equivalents resulting from genetic code degeneracy and thus coding 
for the same amino acid(s). 

The term "fragment," when referring to a coding sequence, means a 
portion of DNA comprising less than the complete coding region whose 
expression product retains essentially the same biological function or activity 
as the expression product of the complete coding region. 

The term "primer" means a short nucleic acid sequence that is paired 
with one strand of DNA and provides a free 3'OH end at which a DNA 
polymerase starts synthesis of a deoxyribonucleotide chain. 

The term "promoter" means a region of DNA involved in binding of RNA 
polymerase to initiate transcription. 



The term "open reading frame (ORF)" means a series of triplets coding 
for amino acids without any termination codons and is a sequence 
(potentially) translatable into protein. 

As used herein, reference to a DNA sequence includes both single 
stranded and double stranded DNA. Thus, the specific sequence, unless the 
context indicates otherwise, refers to the single strand DNA of such 
sequence, the duplex of such sequence with its complement (double 
stranded DNA) and the complement of such sequence. 

In accordance with the present invention, the term "percent identity" or 
"percent identical," when referring to a sequence, means that a sequence is 
compared to a claimed or described sequence after alignment of the sequence 
to be compared (the "Compared Sequence") with the described or claimed 
sequence (the "Reference Sequence"). The Percent identity is then determined 
according to the following formula: 

Percent Identity = 100 [1-(C/R)] 

wherein C is the number of differences between the Reference Sequence and 
the Compared Sequence over the length of alignment between the Reference 
Sequence and the Compared Sequence wherein (i) each base or amino acid in 
the Reference Sequence that does not have a corresponding aligned base or 
amino acid in the Compared Sequence and (ii) each gap in the Reference 
Sequence and (iii) each aligned base or amino acid in the Reference Sequence 
that is different from an aligned base or amino acid in the Compared Sequence, 
constitutes a difference; and R is the number of bases or amino acids in the 
Reference Sequence over the length of the alignment with the Compared 
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Sequence with any gap created in the Reference Sequence also being counted 
as a base or amino acid. 

If an alignment exists between the Compared Sequence and the 
5 Reference Sequence for which the percent identity as calculated above is 
about equal to or greater than a specified minimum Percent Identity then the 
Compared Sequence has the specified minimum percent identity to the 
Reference Sequence even though alignments may exist in which the 
hereinabove calculated Percent Identity is less than the specified Percent 
10 Identity. 

Thus, the present invention is directed to novel, isolated polypeptides 
comprising an amino acid sequence at least 75% identical to a sequence in 
SEQ ID NO: 2, 4 or 6, preferably polypeptides at least 90% identical thereto, 
15 more preferably 95% identical to the sequence of SEQ ID NO: 2 or 4, and 
most preferably having the sequence of either SEQ ID NO: 2 or 4. 

The isolated polypeptides of the present invention may be found in a 
wide variety of microorganisms, but will commonly be found in an organism 

20 selected from the group consisting of group A streptococci, group B 
streptococci, and Staphylococcus aureus, and wherein the group A 
streptococcal organism is Streptococcus pyogenes and the group B 
streptococcal organism is Streptococcus agalactiae. Also, polypeptides of the 
invention include, but are in no way limited to, isolated polypeptides having a 

25 sequence at least 25% identical to the amino acid sequence of the Sp36 
protein of Streptococcus pneumoniae. 

The present invention further relates to immunogenically active 
fragments of the isolated polypeptides disclosed herein. 

30 
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The terms "fragment," "derivative" and "analog" when referring to the 
polypeptides disclosed herein means a polypeptide which retains essentially the 
same biological function or activity as such polypeptide. Thus, an analog 
includes a proprotein, or preprotein, which can be activated by cleavage of the 
proprotein portion to produce an active mature polypeptide. Such fragments, 
derivatives and analogs must have sufficient similarity to the polypeptide of 
SEQ ID N0:2, 4 or 6 so that immunogenic activity of the native polypeptide is 
retained. 

The polypeptide of the present invention may be a recombinant 
polypeptide, a natural polypeptide or a synthetic polypeptide, preferably a 
recombinant polypeptide. 

The fragment, derivative or analog of the polypeptide of SEQ ID N0:2, 
4, or 6 may be (i) one in which one or more of the amino acid residues are 
substituted with a conserved or non-conserved amino acid residue (preferably a 
conserved amino acid residue) and such substituted amino acid residue may or 
may not be one encoded by the genetic code, or (ii) one in which one or more 
of the amino acid residues includes a substituent group, or (iii) one in which 
the mature polypeptide is fused with another compound, such as a compound 
to increase the half-life of the polypeptide (for example, polyethylene glycol), or 
(iv) one in which the additional amino acids are fused to the mature 
polypeptide, such as a leader or secretory sequence or a sequence which is 
employed for purification of the mature polypeptide or a proprotein sequence. 
Such fragments, derivatives and analogs are deemed to be within the scope of 
those skilled in the art from the teachings herein. 

As known in the art "similarity" between two polypeptides is determined 
by comparing the amino acid sequence and its conserved amino acid 
substitutes of one polypeptide to the sequence of a second polypeptide. 
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Fragments or portions of the polypeptides of the present invention may 
be employed for producing the corresponding full-length polypeptide by peptide 
synthesis; therefore, the fragments may be employed as intermediates for 
producing the full-length polypeptides. Fragments or portions of the 
polynucleotides of the present invention may be used to synthesize full-length 
polynucleotides of the present invention. 

As used herein with reference to polypeptides, the terms "portion," 
"segment," and "fragment," refer also to a continuous sequence of residues, 
such as amino acid residues, which sequence forms a subset of a larger 
sequence. For example, if a polypeptide were subjected to treatment with any 
of the common endopeptidases, such as trypsin, chymotrypsin, or papain, the 
oligopeptides resulting from such treatment would represent portions, 
segments or fragments of the starting polypeptide. 

The present invention is also directed to isolated polynucleotides whose 
sequences contain coding regions encoding the polypeptides of the present 
invention, preferably the polypeptides of SEQ ID NO: 2, 4, and 6 and most 
preferably will be the isolated polynucleotides comprising the sequences of 
SEQ ID NOS: 1, 3, and 5, 

The present invention is also directed to fragments or portions of such 
sequences which contain at least 15 bases, preferably at least 30 bases, 
more preferably at least 50 bases and most preferably at least 80 bases, and 
to those sequences which are at least 60%, preferably at least 80%, and 
most preferably at least 95%, especially 98%, identical thereto, and to DNA 
(or RNA) sequences encoding the same polypeptide as the sequences of SEQ 
ID NOS: 2, 4, and 6, including fragments and portions thereof and, when 
derived from natural sources, includes alleles thereof. 
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Yet another aspect of the present invention is directed to an isolated 
DNA (or RNA) sequence or molecule comprising at least the coding region of 
a bacterial gene (or a DNA sequence encoding the same polypeptide as such 
coding region), in particular an expressed bacterial gene, which bacterial 

5 gene comprises a DNA sequence homologous with, or contributing to, the 
sequence depicted in SEQ ID NOS: 1, 3, and 5 or one at least 60%, 
preferably at least 80%, and most preferably at least 95%, especially 98%, 
identical thereto, including 100% identity, as well as fragments or portions 
of the coding region which encode a polypeptide having a similar function to 

10 the polypeptide encoded by said coding region. Thus, the isolated DNA (or 
RNA) sequence may include only the coding region of the expressed gene (or 
fragment or portion thereof as hereinabove indicated) or may further include 
all or a portion of the non-coding DNA (or RNA) of the expressed bacterial 
gene. 

15 

In general, sequences homologous with and contributing to the 
sequences of SEQ ID NOS: 1, 3, and 5 (or one at least 60%, preferably at 
least 80%, and most preferably at least 95% identical or homologous 
thereto) are from the coding region of a bacterial gene. 

20 

The polynucleotides according to the present invention may also occur 
in the form of mixtures of polynucleotides hybridizable to some extent with the 
gene sequences containing any of the nucleotide sequences of SEQ ID NOS: 
1 , 3, and 5, including any and all fragments thereof, and which polynucleotide 
25 mixtures may be composed of any number of such polynucleotides, or 
fragments thereof, including mixtures having at least 10, perhaps at least 30 
such sequences, or fragments thereof. 

Fragments of the full length polynucleotide of the present invention may 
30 be used as hybridization probes for a DNA library to isolate the full length DNA 

14 



and to isolate other DNAs which have a high sequence similarity to the gene or 
similar biological activity. Probes of this type preferably have at least 1 5 bases, 
may have at least 30 bases and even 50 or more bases. The probe may also 
be used to identify a DNA clone corresponding to a full length transcript and a 

5 genomic clone or clones that contain the complete gene including regulatory 
and promoter regions. An example of a screen comprises isolating the coding 
region of the gene by using the known DNA sequence to synthesize an 
oligonucleotide probe. Labeled oligonucleotides having a sequence 
complementary to that of the gene of the present invention are used to screen 

10 a library of DNA or mRNA to determine which members of the library the probe 
hybridizes to. 

The present invention is also directed to vectors comprising the 
polynucleotides disclosed herein, as well as to genetically engineered cells 
15 comprising such vectors and/or polynucleotides. Thus, the present invention 
also relates to vectors which include polynucleotides of the present invention, 
host cells which are genetically engineered with vectors of the Invention and 
the production of polypeptides of the invention by recombinant techniques. 

20 Host cells are genetically engineered (transduced or transformed or 

transfected) with the vectors of this invention which may be, for example, a 
cloning vector or an expression vector. The vector may be, for example, in the 
form of a plasmid, a viral particle, a phage, etc. The engineered host cells can 
be cultured in conventional nutrient media modified as appropriate for 

25 activating promoters, selecting transformants or amplifying the genes of the 
present invention. The culture conditions, such as temperature, pH and the 
like, are those previously used with the host cell selected for expression, and 
will be apparent to the ordinarily skilled artisan. 
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The polynucleotides of the present invention may be employed for 
producing polypeptides by recombinant techniques. Thus, for example, the 
polynucleotide may be included in any one of a variety of expression vectors 
for expressing a polypeptide. Such vectors include chromosomal, 
nonchromosomal and synthetic DNA sequences, e.g., derivatives of SV40; 
bacterial plasmids; phage DNA; baculovirus; yeast plasmids; vectors derived 
from combinations of plasmids and phage DNA, viral DNA such as vaccinia, 
adenovirus, fowl pox virus, and pseudorabies. However, any other vector may 
be used as long as it is replicable and viable in the host. 

The appropriate DNA sequence may be inserted into the vector by a 
variety of procedures. In general, the DNA sequence is inserted into an 
appropriate restriction endonuclease site(s) by procedures known in the art. 
Such procedures and others are deemed to be within the scope of those skilled 
in the art. 

The DNA sequence in the expression vector is operatively linked to an 
appropriate expression control sequence(s) (promoter) to direct mRNA 
synthesis. As representative examples of such promoters, there may be 
mentioned: LTR or SV40 promoter, the E, coli. lac or trp, the phage lambda Pl 
promoter and other promoters known to control expression of genes in 
prokaryotic or eukaryotic cells or their viruses. The expression vector also 
contains a ribosome binding site for translation initiation and a transcription 
terminator. The vector may also include appropriate sequences for amplifying 
expression. 

In addition, the expression vectors preferably contain one or more 
selectable marker genes to provide a phenotypic trait for selection of 
transformed host cells such as dihydrofolate reductase or neomycin resistance 
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for eukaryotic cell culture, or such as tetracycline or ampicillin resistance in E 
coli. 

The vector containing the appropriate DNA sequence as hereinabove 
5 described, as well as an appropriate promoter or control sequence, may be 
employed to transform an appropriate host to permit the host to express the 
protein. 

As representative examples of appropriate hosts, there may be 
10 mentioned: bacterial cells, such as E, coli , Streptomyces , Salmonella 
typhimurium ; fungal cells, such as yeast; insect cells such as Drosophila S2 
and Spodoptera Sf9 ; animal cells such as CHO, COS or Bowes melanoma; 
adenoviruses; plant cells, etc. The selection of an appropriate host is deemed 
to be within the scope of those skilled in the art from the teachings herein. 

15 

More particularly, the present invention also includes recombinant 
constructs comprising one or more of the sequences as broadly described 
above. The constructs comprise a vector, such as a plasmid or viral vector, 
into which a sequence of the invention has been inserted, in a forward or 

20 reverse orientation. In a preferred aspect of this embodiment, the construct 
further comprises regulatory sequences, including, for example, a promoter, 
operably linked to the sequence. Large numbers of suitable vectors and 
promoters are known to those of skill in the art, and are commercially 
available. The following vectors are provided by way of example; Bacterial: 

25 pQE70, pQE60, pQE-9 (Qiagen), pBS, pDIO, phagescript, phiX174, pBluescript 
SK, pBSKS, pNHSA, pNH16a, pNHISA, pNH46A (Stratagene); pTRC99a, 
pKK223-3, PKK233-3, pDR540, pRIT5 (Pharmacia); Eukaryotic: pWLNEO, 
pSV2CAT, pOG44, pXTI, pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL 
(Pharmacia). However, any other plasmid or vector may be used as long as 

30 they are replicable and viable in the host. 
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Promoter regions can be selected from any desired gene using CAT 
(chloramphenicol transferase) vectors or other vectors with selectable markers. 
Two appropriate vectors are pKK232-8 and pCM7. Particular named bacterial 
promoters include lad, lacZ, T3, T7, gpt, lambda Pr, Pl and trp. Eukaryotic 
promoters include CMV immediate early, HSV thymidine kinase, early and late 
SV40, LTRs from retrovirus, and mouse metallothionein-l. Selection of the 
appropriate vector and promoter is well within the level of ordinary skill in the 
art. 

In a further embodiment, the present invention relates to host cells 
containing the above-described constructs. The host cell can be a higher 
eukaryotic cell, such as a mammalian cell, or a lower eukaryotic cell, such as a 
yeast cell, or the host cell can be a prokaryotic cell, such as a bacterial cell. 
Introduction of the construct into the host cell can be effected by calcium 
phosphate transfection, DEAE-Dextran mediated transfection, or 
electroporation (Davis, L., Dibner, M., Battey, I., Basic Methods in Molecular 
Biology, (1986)). . 

The constructs in host cells can be used in a conventional manner to 
produce the gene product encoded by the recombinant sequence. 
Alternatively, the polypeptides of the invention can be synthetically produced 
by conventional peptide synthesizers. 

Mature proteins can be expressed in mammalian cells, yeast, bacteria, or 
other cells under the control of appropriate promoters. Cell-free translation 
systems can also be employed to produce such proteins using RNAs derived 
from the DNA constructs of the present invention. Appropriate cloning and 
expression vectors for use with prokaryotic and eukaryotic hosts are described 
by Sambrook, et aL, Molecular Cloning: A Laboratory Manual, Second Edition, 
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Cold Spring Harbor, N.Y., (1989), the disclosure of which is hereby 
incorporated by reference. 

Transcription of the DNA encoding the polypeptides of the present 
5 invention by higher eukaryotes is increased by inserting an enhancer sequence 
into the vector. Enhancers are cis-acting elements of DNA, usually about from 
10 to 300 bp that act on a promoter to increase its transcription. Examples 
include the SV40 enhancer on the late side of the replication origin bp 100 to 
270, a cytomegalovirus early promoter enhancer, the polyoma enhancer on the 
10 late side of the replication origin, and adenovirus enhancers. 

Generally, recombinant expression vectors will include origins of 
replication and selectable markers permitting transformation of the host cell, 
e.g., the ampicillin resistance gene of E. coli and S. cerevisiae Trpi gene, and a 

15 promoter derived from a highly-expressed gene to direct transcription of a 
downstream structural sequence. Such promoters can be derived from 
operons encoding glycolytic enzymes such as 3-phosphoglycerate kinase 
(PGK), a-factor, acid phosphatase, or heat shock proteins, among others. The 
heterologous structural sequence is assembled in appropriate phase with 

20 translation initiation and termination sequences, and preferably, a leader 
sequence capable of directing secretion of translated protein into the 
periplasmic space or extracellular medium. Optionally, the heterologous 
sequence can encode a fusion protein including an N-terminal identification 
peptide imparting desired characteristics, e.g., stabilization or simplified 

25 purification of expressed recombinant product. 

Useful expression vectors for bacterial use are constructed by inserting a 
structural DNA sequence encoding a desired protein together with suitable 
translation initiation and termination signals in operable reading phase with a 
30 functional promoter. The vector will comprise one or more phenotypic 
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selectable markers and an origin of replication to ensure maintenance of the 
vector and to, if desirable, provide amplification within the host. Suitable 
prokaryotic hosts for transformation include E. coli . Bacillus subtilis . Salmonella 
typhimurium and various species within the genera Pseudomonas, 
Streptomyces, and Staphylococcus, although others may also be employed as 
a matter of choice. 

As a representative but nonlimiting example, useful expression vectors 
for bacterial use can comprise a selectable marker and bacterial origin of 
replication derived from commercially available plasmids comprising genetic 
elements of the well known cloning vector pBR322 (ATCC 37017), Such 
commercial vectors include, for example, pKK223-3 (Pharmacia Fine 
Chemicals, Uppsala, Sweden) and GEM1 (Promega Biotec, Madison, Wl, USA). 
These pBR322 "backbone" sections are combined with an appropriate 
promoter and the structural sequence to be expressed. 

Following transformation of a suitable host strain and growth of the host 
strain to an appropriate cell density, the selected promoter is induced by 
appropriate means (e.g., temperature shift or chemical induction) and cells are 
cultured for an additional period. 

Cells are typically harvested by centrifugation, disrupted by physical or 
chemical means, and the resulting crude extract retained for further 
purification. 

Microbial cells employed in expression of proteins can be disrupted by 
any convenient method, including freeze-thaw cycling, sonication, mechanical 
disruption, or use of cell lysing agents, such methods are well known to those 
skilled in the art. 



20 



Various mammalian cell culture systems can also be employed to 
express recombinant protein. Examples of mammalian expression systems 
include the COS-7 lines of monkey kidney fibroblasts, described by Gluzman, 
Cell, 23:175 (1981), and other cell lines capable of expressing a compatible 

5 vector, for example, the CI 27, 3T3, CHO, HeLa and BHK cell lines. 
Mammalian expression vectors will comprise an origin of replication, a suitable 
promoter and enhancer, and also any necessary ribosome binding sites, 
polyadenylation site, splice donor and acceptor sites, transcriptional 
termination sequences, and 5' flanking nontranscribed sequences. DNA 

10 sequences derived from the SV40 splice, and polyadenylation sites may be 
used to provide the required nontranscribed genetic elements. 

The polypeptide can be recovered and purified from recombinant cell 
cultures by methods including ammonium sulfate or ethanol precipitation, acid 

15 extraction, anion or cation exchange chromatography, phosphocellulose 
chromatography, hydrophobic interaction chromatography, affinity 
chromatography, hydroxylapatite chromatography and lectin chromatography. 
Protein refolding steps can be used, as necessary, in completing configuration 
of the mature protein. Finally, high performance liquid chromatography (HPLC) 

20 can be employed for final purification steps. 

The polypeptides of the present invention may be a naturally purified 
product, or a product of chemical synthetic procedures, or produced by 
recombinant techniques from a prokaryotic or eukaryotic host (for example, by 
25 bacterial, yeast, higher plant, insect and mammalian cells in culture). 
Depending upon the host employed in a recombinant production procedure, the 
polypeptides of the present invention may be glycosylated or may be non- 
glycosylated. Polypeptides of the invention may also include an initial 
methionine amino acid residue. 

30 
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The polypeptides of the present invention, when utilized for clinically 
related purposes, may also be suspended in a pharmacologically acceptable 
diluent or excipient to facilitate such uses, which will include use as a 
vaccine for the purpose of preventing a wide variety of streptococcal and 
staphylococcal infections. 

In accordance with another aspect of the present invention, there is 
provided a vaccine that includes at least one polypeptide that is at least 75% 
identical, preferably at least 90% identical and most preferably 95% identical, 
to a polypeptide sequence comprising the sequence of SEQ ID NO: 2, 4, or 6. 
Such variations in homology for putative vaccines are well known in the art 
(See, for example, Hanson et al., "Active and Passive immunity Against 
Borrelia burgdorferi Decorin Binding Protein A (DbpA)," Infection and Immunity , 
(May) 1998, p. 2143 - 2153; Roberts et al., "Heterogeneity Among Genes 
Including Decorin Binding Proteins A and B of Borrelia burgdorferi sensu lato" 
Infection and Immunity , (Nov) 1998, p. 5275-5285). Such observations would 
similarly apply to portions, segments or fragments of the polypeptides 
disclosed herein. 

Such segments find a multitude of uses. For example, such segments of 
the polypeptides according to the present invention find use as intermediates in 
the synthesis of higher molecular weight structures also within the present 
invention. 

The term "active fragment" means a fragment that generates an 
immune response (i.e., has immunogenic activity) when administered, alone or 
optionally with a suitable adjuvant, to an animal, such as a mammal, for 
example, a rabbit or a mouse, and also including a human. 
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In accordance with a further aspect of the invention, a vaccine of the 
type hereinabove described is administered for the purpose of preventing or 
treating infection caused by streptococci and staphylococci as well as nriany 
related organisms. 

A vaccine in accordance with the present invention may include one or 
more of the hereinabove described polypeptides or active fragments thereof. 
When employing more than one polypeptide or active fragment, such as two or 
more polypeptides and/or active fragments may be used as a physical mixture 
or as a fusion of two or more polypeptides or active fragments. The fusion 
fragment or fusion polypeptide may be produced, for example, by recombinant 
techniques or by the use of appropriate linkers for fusing previously prepared 
polypeptides or active fragments. 

In many cases, the variation in the polypeptide or active fragment is a 
conservative amino acid substitution, although other substitutions are within 
the scope of the invention. 

In accordance with the present invention, a polypeptide variant includes 
variants in which one or more amino acids are substituted and/or deleted 
and/or inserted. 

In another aspect, the invention relates to passive immunity vaccines 
formulated from antibodies against a polypeptide or active fragment of a 
polypeptide of the present invention. Such passive immunity vaccines can be 
utilized to prevent and/or treat streptococcal and staphylococcal infections in 
patients. In this manner, according to a further aspect of the invention, a 
vaccine can be produced from a synthetic or recombinant polypeptide of the 
present invention or an antibody against such polypeptide. 
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Still another aspect the present invention relates to a method of using 
one or more antibodies (monoclonal, polyclonal or sera) to the polypeptides of 
the invention as described above for the prophylaxis and/or treatment of 
diseases that are caused by streptococcal and staphylococcal bacteria. In 
particular, the invention relates to a method for the prophylaxis and/or 
treatment of infectious diseases that are caused by streptococci and 
staphylococci. In a still further preferred aspect, the invention relates to a 
method for the prophylaxis and/or treatment of such diseases as necrotizing 
fasciitis, scarlet fever, sepsis and many diseases of newborns, in humans by 
utilizing a vaccine of the present invention. 

Generally, vaccines are prepared as injectables, in the form of aqueous 
solutions or suspensions. Vaccines in an oil base are also well known such as 
for inhaling. Solid forms which ate dissolved or suspended prior to use may 
also be formulated. Pharmaceutical carriers, diluents and excipients are 
generally added that are compatible with the active ingredients and acceptable 
for pharmaceutical use. Examples of such carriers include, but are not limited 
to, water, saline solutions, dextrose, or glycerol. Combinations of carriers may 
also be used. 

Vaccine compositions may further incorporate additional substances to 
stabilize pH, or to function as adjuvants, wetting agents, or emulsifying agents, 
which can serve to improve the effectiveness of the vaccine. 

Vaccines are generally formulated for parenteral administration and are 
injected either subcutaneously or intramuscularly. Such vaccines can also be 
formulated as suppositories or for oral administration, using methods known in 
the art, or for administration through nasal or respiratory routes. 
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The amount of vaccine sufficient to confer immunity to pathogenic 
bacteria is determined by methods well known to those skilled in the art. This 
quantity will be determined based upon the characteristics of the vaccine 
recipient and the level of immunity required. Typically, the amount of vaccine 
5 to be administered will be determined based upon the judgment of a skilled 
physician. Where vaccines are administered by subcutaneous or intramuscular 
injection, a range of 0.5 to 500 |iig purified protein may be given. 

The present invention is also directed to a vaccine in which a 
10 polypeptide or active fragment of the present invention is delivered or 
administered in the form of a polynucleotide encoding the polypeptide or active 
fragment, whereby the polypeptide or active fragment is produced in vivo. The 
polynucleotide may be included in a suitable expression vector and combined 
with a pharmaceutically acceptable carrier, 

15 

Thus, the present invention expressly contemplates a vaccine 
composition comprising any of the polypeptides disclosed herein, said 
polypeptide being present in an amount effective to produce an immune 
response, and wherein said polypeptide is suspended in a pharmacologically 
20 acceptable carrier, diluent or excipient. 

The vaccine compositions of the present invention may also comprise 
live vaccines, containing such organisms as Steptococcus gordoniae and 
Salmonella typhi, wherein said organisms contain recombinant polypeptides as 
25 disclosed herein. 

In addition, the polypeptides of the present invention can be used as 
immunogens to stimulate the production of antibodies for use in passive 
immunotherapy, for use as diagnostic reagents, and for use as reagents in 
30 other processes such as affinity chromatography. 
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Thus, the present invention is also directed to methods for the 
prevention of a wide variety of diseases caused by streptococcal and 
staphylococcal organisms, said methods involving the administering of 
5 vaccines disclosed herein to animals at risk of such diseases, especially 
where said animals are humans. 
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In addition, the invention disclosed herein is also directed to a means 
of treating animals, especially humans, afflicted with a disease caused by the 
10 organisms from which the isolated polypeptides of the invention are derived, 
such methods including, but not being limited to, administering to an animal, 
especially a human, afflicted with such a disease of a therapeutically 
effective amount of an antibody, or mixture of antibodies, against the 
polypeptides disclosed herein. 

15 

Antibodies specific for the polypeptides disclosed herein may be 
either polyclonal or monoclonal and may even be in the form of antisera. 
When such antibodies are monoclonal in nature, they may be produced by 
conventional methods of preparing monoclonal antibodies, such as from 

20 conventional hybridoma cells, and may also be produced by genetically 
engineered cells transformed with vectors containing genes specifically 
coding for the different heavy and light chains of antibody molecules having 
an arrangement of variable regions specifically complementary to one or 
more of the polypeptides of the invention. Such recombinantly produced 

25 antibodies may be in the form of either dimers or tetramers, depending on 
the type of cellular expression system utilized therefor. 



The invention will now be further described in more detail in the 
following non-limiting examples and it will be appreciated that additional and 
30 different embodiments of the teachings of the present invention will 
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doubtless suggest themselves to those of skill in the art and such other 
embodiments are considered to have been inferred from the disclosure 
herein. 



Example 1 

Southern Blot Analysis of Chromosomal DNA Using Probes Specific for the 
Sp36 Gene of Streptococcus pneumoniae 

Genomic DNA was isolated from Staphylococcus aureus, Streptococcus 
pyogenes (group A), and Streptococcus agalactiae (group B) after overnight 
growth of the bacteria. The DNA was digested to completion by overnight 
incubation with restriction enzymes (BamVW and PvuW), and then DNA 
fragments were resolved by size by agarose gel electrophoresis before transfer 
to a nylon membrane. The membrane was then probed with DNA encoding the 
entire Sp36 open reading frame that had been fluorescein-labeled with random 
primers using a kit from Amersham Pharmacia Biotech Inc. The hybridization 
and washes were carried out under low stringency conditions (i.e., 45°C, 
5xSSC hybridization; 45^0, IxSSC for V wash; 45°C, 0.5xSSC for 2"^ wash). 
Here, SSC is composed of 150 mM NaCI and 15 nM sodium citrate, pH 7.0 
and all washes are 50 mL each. 

After hybridization and washing was complete, the bound, fluorescein- 
labeled probe was detected using an anti-fluorescein antibody as per the 
manufacturer's instructions with the kit. Similarly digested DNA from 
Streptococcus pneumoniae strain SJ2 (serotype 6B) was used as a positive 
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control. Fluorescein-labeled bacteriophage lambda DNA digested with the 
restriction nuclease HindlW was used as a size marker. 

The Sp36 probe hybridized with a single fragment in the digested S. 
5 aureus DNA {-'4.5 kb BamH\ fragment, -5 kb PvuW fragment) and with 2 major 
fragments in a PvuW digest of serotype Ml of the group A streptococci 
genomic DNA (-4.0 kb. and -4.2 kb ). 

Example 2 

10 

BLAST Analysis Using Sp36 Predicted Amino Acid Sequence 

Sequence comparisons of the Sp36 encoded protein sequence against 
J= the publicly available GenBank sequence database (including the unfinished 

!^ 15 microbial database 
W (http://www.ncbi.nlm.nih.gov/BLAST/unfinishedgenome.html)) revealed two 

Q highly homologous amino acid sequences. One of these was a predicted amino 

f'J acid sequence from the S. pyogenes genome. This predicted polypeptide 

m comprised 825 amino acid residues (MW = 92,616 Da) that was 25.1% 

O 20 identical to the Sp36 amino acid sequence from pneumococcus serotype 4 but 

maintained the 5 histidine triads (underlined in Figure 5(a) - SEQ ID NO: 2). The 
second polypeptide encoded within the S. pyogenes database contained 
several errors that were corrected by our sequencing of this region of the 
genome. The DNA fragment obtained encoded a protein of 792 amino acids 
25 (MW = 87,457 Da) that was 12.6% identical to the pneumococcal sequence 
and 12,5% identical to the first S. pyogenes polypeptide. This predicted amino 
acid sequence contained four histidine triad motifs (underlined in Fig. 5(b) - 
SEQ ID NO.: 4). The third polypeptide sequence obtained was one already in 
the database (Accession No. AF062533) and identified only as an unknown 
30 gene downstream from a gene identified as /mb in S. galactiae. This 822 
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amino acid protein thus has a predicted molecular weight of 92,353 Da and 
maintains the 5 histidine triad motifs (underlined in Figure 5(C) - SEQ ID NO: 
6). This second polypeptide shows 25.6% sequence identity to Sp36 of 
pneumococcus type 4 and 97.7% and 11.6% identity to the two group A 
5 homologs, respectively. 



Example 3 

10 

Southern Blot Analysis Using a group A Streptococcal Sp36 Homolog Probe 

W Southern blot analysis was performed with a fluorescein-labeled DNA 

!ji fragment as probe, which encoding a group A streptococcal Sp36 homolog 

fi 15 cloned from an Ml serotype of the group A streptococcal genome. This 
^ fragment was then used to probe genomic DNA from an M6 serotype of the 

O group A streptococcal genome, as well as serotype 1 a and serotype 3 of the 

m group B streptococcal genome, and strain SJ2 (serotype 6B) of 

^i} pneumococcus. In all cases, a single band was obtained in DNA digested with 

Q 20 BamHl when hybridization was carried out under low stringency conditions (as 

described above). A band of about 20 kb was visualized in group A 
streptococcal DNA, about 4.5 kb was obtained for group B streptococcal DNA, 
and a band of about 4kb was seen for pneumococcus. 

25 

Example 4 

Western Blot Analysis of Reactivity of group B Streptococcal Homolog With 
30 Anti -Pneumococcal Sp36 Antiserum 
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To determine whether antiserum raised against recombinant Sp36 from 
S. pneumoniae would recognize the recombinant Sp36 homolog encoded by 
group B streptococcal organisms, a western blot was performed. One hundred 

5 nanograms {100 ng) of recombinant Sp36 polypeptide cloned from either S, 
pneumoniae serotype 4, or of the Sp36 homolog cloned from group B 
streptococcal organisms, or from an unrelated recombinant protein control 
expressed and purified in the same way, were subjected to SDS-PAGE 
containing 12% acrylamide. A cell lysate of pneumococcal strain SJ2 (serotype 

10 6B) was also included on the gel. After electrophoresis, the separated proteins 
were transferred to a nitrocellulose membrane and probed with rabbit 
polyclonal antiserum raised against the recombinant pneumococcal protein. 
Bound antibodies were detected chemiluminescently with a goat anti-rabbit IgG 
antibody conjugated to horseradish peroxidase using the substrate ECL (from 

15 Amersham). The results demonstrate that antiserum raised against the 
pneumococcal Sp36 protein cross-react with the Sp36 homolog identified from 
the group B streptococci and thereby indicating conservation of epitopes 
between the proteins. The group B streptococcal homolog is also 
approximately the same size as the protein detected in S. pneumoniae lysates. 

20 Because the group A and B homologs are highly homologous, if not identical, 
such antiserum would also likely cross-react with the group A streptococcal 
protein. 

25 Example 5 

Alignment of Predicted Amino Acid Sequences of the Sp36 Homologs from 
group A and B Streptococci With Pneumococcal Sp36 
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The predicted amino acid sequences from the Sp36 genes from group A 
and group B streptococci and S. pneumoniae were aligned using the Clustal 
algorithm in a DNAStar Computer package (DNAStar, Inc., Madison, Wl). 
Amino acids that match those encoded by the pneumococcal gene are boxed 
5 in Figure 2 (showing the results of the alignment). Gaps introduced in the 
sequence by the alignment process are indicated by dashed lines. 



10 Example 6 

3 Percentage Sequence Identity Between Homologs of Sp36 

The Sp36 amino acid sequence from pneumococci is 25.6% identical to 

* 

W the predicted amino acid sequence of the homologous gene of group B 

2 15 streptococci and 25.1 % and 1 2.6% identical to the deduced sequences of the 

1^ *IU 

two genes from group A streptococci. Furthermore, the group B homolog is 
O 97.7% and 11.6% identical to the first {GAS36) and second (GAS36(2)) 

fU homologs from group A streptococci, respectively. These experiments indicate 

!i{ that homologous genes to Sp36 from pneumococcus are present in group A 

20 and group B streptococci, as well as in Staphylococcus aureus. The protein 
encoded by this gene may therefore perform a similar function in these 
different organisms. This suggests that a vaccine comprising one or more of 
these proteins may be broadly protective against these species. These results 
are summarized in Table 1 which shows the percent identity between the 
25 amino acid sequences of Sp36 from pneumococcus strain Norway 4 
(serotype 4), group A streptococci Sp36 homolog from an Ml serotype, and 
group B streptococci Sp36 from strain R268. 
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Table 1 . 



Pneumo. Sp36 GAS36 GAS36(2) GBS36 



12.6% 25.6% 

97.7% 
100% 11.6% 

100% 

where GAS36 = SEQ ID NO: 2 

GAS36(2) = SEQ ID NO: 4 
GBS36 = SEQ ID NO: 6 



Pneumo. Sp36 100% 25.1% 

GAS36 — 100% 
GAS36{2) 
GBS36 
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WHAT IS CLAIMED IS 



1 . An isolated polypeptide comprising an amino acid sequence at least 
75% identical to a sequence selected from the group consisting of SEQ ID NO: 
2, 4 and 6. 

2. The isolated polypeptide of claim 1 wherein said polypeptide is at 
least 90% identical to the sequence selected from the group consisting of SEQ 
ID NO: 2, 4, and 6. 

3. The isolated polypeptide of claim 1 wherein said polypeptide is at 
least 95% identical to the sequence selected from the group consisting of SEQ 
ID NO: 2, 4, and 6. 

4. The isolated polypeptide of claim 1 wherein said polypeptide has the 
amino acid sequence selected from the group consisting of SEQ ID NO: 2, 4 
and 6. 

5. The isolated polypeptide of claim 1 wherein said polypeptide is found 
in an organism selected from the group consisting of group A streptococci, 
group B streptococci, and Staphylococcus aureus, 

6. The isolated polypeptide of claim 5 wherein the group A 
streptococcal organism is Streptococcus pyogenes, 

7. The isolated polypeptide of claim 5 wherein the group B streptococcal 
organism is Streptococcus agalactiae. 
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8. The isolated polypeptide of claim 1 wherein said polypeptide has a 
sequence at least 25% identical to the amino acid sequence of the Sp36 
protein of Streptococcus pneumoniae. 

9. An isolated polynucleotide comprising a sequence coding for a 
polypeptide selected from the group consisting of the polypeptides of claims 1 , 
2, 3, 4, 5, 6, 7, and 8. 

10. The isolated polynucleotide of claim 9 wherein said polynucleotide 
has a nucleotide sequence selected from the group consisting of SEQ ID NO: 
1 , 3 and 5. 

11. An antibody specific for a polypeptide selected from the group 
consisting of the polypeptides of claims 1 , 2, 3, 4, 5, 6, 7, and 8. 

1 2. The antibody of claim 1 1 wherein said antibody is a monoclonal 
antibody. 

1 3. A genetically engineered cell producing the antibody of claim 1 2. 

14. A vector comprising the polynucleotide of claim 9. 

1 5. A vector comprising the polynucleotide of claim 1 0. 

16. A genetically engineered cell expressing the polypeptide coded for 
by the polynucleotide of claim 9 or 10. 

17. A composition comprising a polypeptide selected from the group 
consisting of the polypeptides of claims 1, 2, 3, 4, 5, 6, 7, and 8, said 
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polypeptide being suspended in a pharmacologically acceptable diluent or 
excipient. 

18. A vaccine composition comprising a polypeptide selected from the 
group consisting of the polypeptide of claims 1, 2, 3, 4, 5, 6, 7, and 8, said 
polypeptide being present in an amount effective to produce an immune 
response, and wherein said polypeptide is suspended in a pharmacologically 
acceptable carrier, diluent or excipient. 

19. A vaccine comprising an immunogenically active amount of the 
composition of claim 17, 

20. A method of vaccinating an animal against infection by a bacterial 
organism selected from the group consisting of streptococcal bacteria and 
staphylococcal bacteria comprising administering to said animal an 
immunologically effective amount of the vaccine of claim 19. 

21 . The method of claim 20 wherein said animal is a human. 

22. A method of treating a disease comprising administering to an 
animal afflicted therewith of a therapeutically effective amount of an antibody 
of claim 12 wherein said antibody is suspended in a pharmacologically 
acceptable carrier, diluent or excipient. 

23. The method of claim 22 wherein said animal is a human. 

24. The method of claim 22 wherein said disease is caused by an 
organism selected from the group consisting of group A streptococci, group B 
streptococci, and Staphylococcus aureus. 
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ABSTRACT 



The invention is directed to isolated polypeptides bearing sequence 
homology to the Sp36 protein found in pneumococcal organisms, such as 
Streptococcus pneumoniae. Polynucleotides encoding such polypeptides are 
also disclosed. The Invention also relates to antibodies specific for the 
disclosed polypeptides and to uses of such antibodies in the treatment of 
diseases caused by staphylococci as well as group A and B streptococci. In 
addition, the invention relates to the use of the disclosed polypeptides in 
compositions and as vaccines and for prophylactic uses such as in vaccination 
of animals, especially humans, against a wide variety of streptococcal, 
staphylococcal and other diseases. 
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Figure 1 
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Figure 2 (a) 
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Figure 2 (b) 
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Figure 2 (c) 
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Figure 3 
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Figure 4 
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<110> Heinrichs^ Jon 

Johnson, Leslie S, 
Koenig, Scott 
Adamou, John E, 



<120> Pneumococcal Protein Homo logs and Fragments for 
Vaccines 



<130> 469201-402 



<140> 
<141> 



<150> U.S. 60/150,750 
<151> 1999-08-25 



<160> 6 



<170> Patentin Ver. 2.1 



<210> 1 

<211> 2478 

<212> DNA 

<213> Streptococcus pyogenes 



<400> 1 

gtgaagaaaa catatggtta tatcggctca gttgctgcta ttttactagc tactcatatt 60 
ggaagttacc aacttggtaa gcatcatatg ggttcagcaa caaaggacaa tcaaattgcc 12 0 
tatattgatg atagcaaagg taaggcaaaa gcccctaaaa caaacaaaac gatggatcaa 18 0 
atcagtgctg aagaaggcat ctctgctgaa cagatcgtag tcaaaattac tgaccaaggc 24 0 
tatgtgacct cacatggtga ccattatcat ttttacaatg ggaaagttcc ttatgatgcg 3 00 
attattagtg aagagttgtt gatgacggat cctaattacc gttttaaaca atcagacgtt 3 60 
atcaatgaaa tcttagacgg ttacgttatt aaagtcaatg gcaactatta tgtttacctc 420 
aagccaggta gcaagcgcaa aaacattcga accaaacaac aaattgctga gcaagtagcc 480 
aaaggaacta aagaagctaa agaaaaaggt ttagctcaag tggcccatct cagtaaagaa 54 0 
gaagttgcgg cagtcaatga agcaaaaaga caaggacgct atactacaga cgatggctat 600 
atttttagtc cgacagatat cattgatgat ttaggagatg cttatttagt acctcatggt 660 
aatcactatc attatattcc taaaaaggat ttgtctccaa gtgagctagc tgctgcacaa 720 
gcctactgga gtcaaaaaca aggtcgaggt gctagaccgt ctgattaccg cccgacacca 78 0 
gccccagccc caggtcgtag gaaagcccca attcctgatg tgacgcctaa ccctggacaa 84 0 
ggt cat cage cagataacgg tggctatcat ccagcgcctc ctaggccaaa tgatgcgtca 90 0 
caaaacaaac accaaagaga tgagtttaaa ggaaaaacct ttaaggaact tttagatcaa 96 0 
ctacaccgtc ttgatttgaa ataccgtcat gtggaagaag atgggttgat ttttgaaccg 102 0 
actcaagtga tcaaatcaaa cgcttttggg tatgtggtgc ctcatggaga tcattatcat 108 0 
attatcccaa gaagtcagtt atcacctctt gaaatggaat tagcagatcg atacttagcc 1140 
ggccaaactg aggacgatga ctcaggttca gatcactcaa aaccatcaga taaagaagtg 1200 



acacatacct ttcttggtca tcgcatcaaa 
tatgatacga gtgatgctta tgtttttagt 
ggagttacag ctaaacacgg agatcatttc 
tatgagttgg atgaggtcgc taactgggtg 
gctgctttgg atcaggaaca aggcaaagaa 
cgcaaagtaa caaaagatgg taaagtgggc 
ttctatgctc gtgatcaact tgatttgact 
cttaaagata agaaacatta ccgttatgac 
gctgtagatg tgtcaagtct gccgatgcat 
tcgtttgtta tccctcatat tgatcatatc 
gatcagattg caacaatcaa gtatgtgatg 
tctaagccag ggcatgaaga gtcaggttcg 
cgtgctggta tgccaaactg gcaaattatc 
gcagaaggtc gttttgcaac accagacggc 
aaagaaactt ttgtatggaa agatggctcc 
ttgagaacca ttaataaatc t gate tat cc 
ttggcaaaga aaaacgctgg tgatgctact 
gcagataaga gcaatgaaaa ccaacagcca 
tcagatgact ttatagacag tttaccagac 
catatcaatc aattagcaca aaaagctaat 
gaaggtgtcc aattttataa taaaaatggt 
caacaaataa acccttaa 



gcttacggaa aaggcttaga tggtaaacca 1260 
aaagaatcca ttcattcagt ggataaatca 13 2 0 
cactatatag gatttggaga acttgaacaa 138 0 
aaagcaaaag gtcaagctga tgagcttgct 1440 
aaaccactct ttgacactaa aaaagtgagt 1500 
tat at gat gc caaaagatgg caaggactat 1560 
cagattgcct ttgccgaaca agaactaatg 1620 
attgttgaca caggtattga gccacgactt 168 0 
gctggtaatg ctacttacga tactggaagt 1740 
catgtcgttc cgtattcatg gttgacgcgc 1800 
caacaccccg aagttcgtcc ggatatatgg 1860 
gtcattccaa atgttacgcc tcttgataaa 1920 
cattctgctg aagaagttca aaaagcccta 1980 
tatattttcg atccacgaga tgttttggcc 2040 
tttagcatcc caagagcaga tggcagttca 2100 
caagctgagt ggcaacaagc tcaagagtta 2160 
gatacggata aacccaaaga aaagcaacag 2220 
agtgaagcca gtaaagaaga agaaaaagaa 2280 
tatggtctag atagagcaac cctagaagat 2340 
atcgatccta agtatctcat tttccaacca 2400 
gaattggtaa cttatgatat caagacactt 2460 

2478 



<210> 2 
<211> 825 
<212> PRT 

<213> Streptococcus pyogenes 
<400> 2 

Val Lys Lys Thr Tyr Gly Tyr lie Gly Ser Val Ala Ala lie Leu Leu 
15 10 15 

Ala Thr His lie Gly Ser Tyr Gin Leu Gly Lys His His Met Gly Ser 

20 25 30 

Ala Thr Lys Asp Asn Gin lie Ala Tyr lie Asp Asp Ser Lys Gly Lys 
35 40 45 

Ala Lys Ala Pro Lys Thr Asn Lys Thr Met Asp Gin lie Ser Ala Glu 
50 55 60 

Glu Gly He Ser Ala Glu Gin He Val Val Lys He Thr Asp Gin Gly 
65 70 75 80 



Tyr Val Thr Ser His Gly Asp His Tyr His Phe Tyr Asn Gly Lys Val 

85 90 95 



Pro Tyr Asp Ala 

100 

Tyr Arg Phe Lys 
115 

Val lie Lys Val 
130 

Lys Arg Lys Asn 
145 

Lys Gly Thr Lys 



Leu Ser Lys Glu 

180 

Arg Tyr Thr Thr 
195 

Asp Asp Leu Gly 
210 

Tyr lie Pro Lys 
225 

Ala Tyr Trp Ser 



Arg Pro Thr Pro 

260 

Asp Val Thr Pro 
275 

Tyr His Pro Ala 
290 

Gin Arg Asp Glu 
305 

Leu His Arg Leu 



lie Phe Glu Pro 

340 



lie lie Ser Glu 



Gin Ser Asp Val 

120 

Asn Gly Asn Tyr 
135 

lie Arg Thr Lys 
150 

Glu Ala Lys Glu 
165 

Glu Val Ala Ala 



Asp Asp Gly Tyr 

200 

Asp Ala Tyr Leu 
215 

Lys Asp Leu Ser 
230 

Gin Lys Gin Gly 
245 

Ala Pro Ala Pro 



Asn Pro Gly Gin 

280 

Pro Pro Arg Pro 
295 

Phe Lys Gly Lys 
310 

Asp Leu Lys Tyr 
325 

Thr Gin Val lie 



Glu Leu Leu Met 
105 

lie Asn Glu lie 



Tyr Val Tyr Leu 

140 

Gin Gin lie Ala 
155 

Lys Gly Leu Ala 
170 

Val Asn Glu Ala 
185 

lie Phe Ser Pro 



Val Pro His Gly 

220 

Pro Ser Glu Leu 
235 

Arg Gly Ala Arg 
250 

Gly Arg Arg Lys 
265 

Gly His Gin Pro 



Asn Asp Ala Ser 

300 

Thr Phe Lys Glu 
315 

Arg His Val Glu 
330 

Lys Ser Asn Ala 
345 



Thr Asp Pro Asn 
110 

Leu Asp Gly Tyr 
125 

Lys Pro Gly Ser 



Glu Gin Val Ala 

160 

Gin Val Ala His 
175 

Lys Arg Gin Gly 
190 

Thr Asp lie lie 
205 

Asn His Tyr His 



Ala Ala Ala Gin 

240 

Pro Ser Asp Tyr 

2l 5 5 

Ala Pro lie Pro 
270 

Asp Asn Gly Gly 
285 

Gin Asn Lys His 



Leu Leu Asp Gin 

320 

Glu Asp Gly Leu 
335 

Phe Gly Tyr Val 
350 



1 



Val Pro His 
355 

Pro Leu Glu 
370 

Asp Asp Asp 
385 

Thr His Thr 



Asp Gly Lys 



Ser lie His 
435 

0 His Phe His 

^ Glu Val Ala 

1 465 

TZ. Ala Ala Leu 



Lys Lys Val 



Met Pro Lys 
515 

Leu Thr Gin 
530 

Lys His Tyr 
545 

Ala Val Asp 



Asp Thr Gly 



Val Pro Tyr 
595 



Gly Asp His Tyr 



Met Glu Leu Ala 

375 

Ser Gly Ser Asp 
390 

Phe Leu Gly His 
405 

Pro Tyr Asp Thr 
420 

Ser Val Asp Lys 



Tyr lie Gly Phe 

455 

Asn Trp Val Lys 
470 

Asp Gin Glu Gin 
485 

Ser Arg Lys Val 
500 

Asp Gly Lys Asp 



lie Ala Phe Ala 

535 

Arg Tyr Asp lie 
550 

Val Ser Ser Leu 
565 

Ser Ser Phe Val 
580 

Ser Trp Leu Thr 



His lie lie Pro Arg 
360 

Asp Arg Tyr Leu Ala 

380 

His Ser Lys Pro Ser 

395 

Arg lie Lys Ala Tyr 
410 

Ser Asp Ala Tyr Val 
425 

Ser Gly Val Thr Ala 
440 

Gly Glu Leu Glu Gin 

460 

Ala Lys Gly Gin Ala 

475 

Gly Lys Glu Lys Pro 
490 

Thr Lys Asp Gly Lys 
505 

Tyr Phe Tyr Ala Arg 
520 

Glu Gin Glu Leu Met 

540 

Val Asp Thr Gly lie 

555 

Pro Met His Ala Gly 
570 

lie Pro His lie Asp 
585 

Arg Asp Gin lie Ala 
600 



Ser Gin Leu Ser 
365 

Gly Gin Thr Glu 



Asp Lys Glu Val 

400 

Gly Lys Gly Leu 
415 

Phe Ser Lys Glu 
430 

Lys His Gly Asp 
445 

Tyr Glu Leu Asp 



Asp Glu Leu Ala 

480 

Leu Phe Asp Thr 
495 

Val Gly Tyr Met 
510 

Asp Gin Leu Asp 
525 

Leu Lys Asp Lys 



Glu Pro Arg Leu 

560 

Asn Ala Thr Tyr 
575 

His lie His Val 
590 

Thr lie Lys Tyr 
605 



4 



4 



Val Met Gin His Pro Glu Val Arg Pro Asp lie Trp Ser Lys Pro Gly 
610 615 620 

His Glu Glu Ser Gly Ser Val lie Pro Asn Val Thr Pro Leu Asp Lys 

625 630 635 640 

Arg Ala Gly Met Pro Asn Trp Gin lie lie His Ser Ala Glu Glu Val 

645 650 655 

Gin Lys Ala Leu Ala Glu Gly Arg Phe Ala Thr Pro Asp Gly Tyr lie 

660 665 670 

Phe Asp Pro Arg Asp Val Leu Ala Lys Glu Thr Phe Val Trp Lys Asp 
675 680 685 

Gly Ser Phe Ser lie Pro Arg Ala Asp Gly Ser Ser Leu Arg Thr lie 
690 695 700 

Asn Lys Ser Asp Leu Ser Gin Ala Glu Trp Gin Gin Ala Gin Glu Leu 
705 710 715 720 

Leu Ala Lys Lys Asn Ala Gly Asp Ala Thr Asp Thr Asp Lys Pro Lys 

725 730 735 

Glu Lys Gin Gin Ala Asp Lys Ser Asn Glu Asn Gin Gin Pro Ser Glu 

740 745 750 

Ala Ser Lys Glu Glu Glu Lys Glu Ser Asp Asp Phe lie Asp Ser Leu 
755 760 765 

Pro Asp Tyr Gly Leu Asp Arg Ala Thr Leu Glu Asp His lie Asn Gin 
770 775 780 

Leu Ala Gin Lys Ala Asn lie Asp Pro Lys Tyr Leu lie Phe Gin Pro 
785 790 795 800 

Glu Gly Val Gin Phe Tyr Asn Lys Asn Gly Glu Leu Val Thr Tyr Asp 

805 810 815 

lie Lys Thr Leu Gin Gin lie Asn Pro 

820 825 



<210> 3 
<211> 2379 
<212> DNA 

<213> Streptococcus pyogenes 



5 



<400> 3 

atgaaaacga aaaaagttat tattttagtt ggtctattgt tatcatctca gttgactttg 60 
atagcttgtc aatcacgagg taatggtaca tatcccatta aaacgaaaca atcacgtaag 12 0 
ggaatgacgt caaacaaaat taaaccgatt aaaaaaagca aaaagacaaa caagactcac 180 
aaaggtgtgg cgggtgtcga ttttcctaca gatgatgggt ttattttaac caaagactca 24 0 
aaaatcttat caaaaacaga tcagggaatc gttgttgacc atgatggtca ttcgcatttt 3 00 
attttttatg ccgatttaaa gggaagtcca tttgaatacc ttattccaaa aggagcaagt 360 
ttagctaagc cagctgttgc tcagcgagca gctagtcaag ggacttctaa agtagcagat 420 
cctcatcacc attatgaatt taacccagcg gatattgtgg ctgaagatgc tttaggctac 480 
acggttcgcc acgatgatca cttccattat attttgaagt caagcttatc aggtcagaca 540 
caggcacaag ctaaacaggt tgctactcgc ttgccacaaa ccagtagcct tgtttcaaca 600 
gctacagcta atggtattcc aggcttgcat ttcccaacct cagatggttt tcaatttaac 660 
ggtcaaggta ttgttggggt aacaaaagac agtattttag tggaccacga tggtcactta 720 
catcctattt cttttgcgga ccttcgtcag ggtggctggg cacatgtggc agatcaatac 780 
gatcccgcta aaaaagcaga aaagccagca gaaacccatc agacaccaga gctatctgaa 840 
cgtgaaaagg aataccaaga aaaattagct tatttggcag aaaaattggg gattgatcca 900 
tcaactatta aacgtgtgga aacacaagac ggtaaacttg gtttggaata ccctcaccat 960 
gaccacgcac acgtattgat gttatctgat attgaaatcg gaaaagacat tccagatcca 1020 
catgctattg agcatgcccg tgaattggaa aaacataagg ttggaatgga taccttgcgt 1080 
gccttagggt ttgatgaaga agtgattttg gatatcgttc gcactcacga tgctccaacc 1140 
ccattcccat caaatgaaaa agatccgaat atgatgaaag aatggttagc aacggttatc 12 00 
aaacttgact tgggcagccg taaagatcct ttgcaacgta aaggactttc actgttaccc 1260 
aacttagaaa ctttaggaat tggctttaca ccaatcaaag atatctcacc tgttttgcaa 1320 
tttaaaaaat tgaaacagtt gttaatgaca aaaacagggg tgactgatta tagatttttg 13 80 
gataatatgc cacagttaga aggcattgat atttcacaaa acaatctcaa agatattagt 144 0 
ttcttgagca aatataaaaa cttaactcta gtagcggctg ctgataatgg tattgaagat 1500 
attaggccgc ttggtcaatt accaaatctc aaattcctcg tattgagtaa caataagatt 1560 
tctgatttaa gcccactggc atcgttacat caattgcaag aattgcacat tgataataat 1620 
cagattacag atttaagccc tgtttctcat aaagaatcat tgacggttgt tgatttatca 1680 
agaaatgctg atgttgactt agcaacactt caagcaccca aattagaaac gttaatggtc 1740 
aatgatacca aggtttctca tttggatttc ttgaaaaata atcctaatct atctagccta 1800 
tctattaacc gtgcgcaatt gcaatctctt gaaggtattg aagcaagtag cgtcattgtc 1860 
agagtagaag cagaaggtaa ccaaattaaa tcgcttgtgc ttaaagacaa gcaagggtca 1920 
cttactttct tggatgtgac aggcaaccag ttgacttctc tagaaggtgt taataatttt 1980 
acagcacttg acattttaag cgtgtctaaa aaccaattaa caaatgtcaa cctatctaaa 2 040 
cccaataaga cagttactaa cattgatatt agtcataaca atatctcatt agcagacctt 2100 
aaattgaacg agcaacatat tccagaagcc attgcgaaaa acttcccagc ggtttacgaa 2160 
ggttctatgg taggtaatgg aacagctgaa gaaaaagcag ctatggctac taaggcgaaa 2220 
gaaagtgctc aagaagcatc ggaatcacat gactacaacc ataatcatac ctatgaagat 2280 
gaagaaggtc atgctcacga gcacagagac aaagatgatc acgaccatga acatgaggat 2340 
gaaaatgaag ctaaagatga gcaaaaccat gctgactaa 23 79 



<210> 4 

<211> 792 
<212> PRT 

<213> Streptococcus pyogenes 



<400> 4 
Met Lys 
1 

Gin Leu 



lie Lys 



Pro lie 
50 

Gly Val 
65 

Lys lie 



His Ser 



Leu 



7 Arg Ala 
13 0 

fy Tyr Glu 

ifi 145 

fl Thr Val 



Ser Gly 



Gin Thr 



Leu His 
210 

Val Gly 
225 

His Pro 



Thr Lys Lys Val 

5 

Thr Leu lie Ala 
20 

Thr Lys Gin Ser 
35 

Lys Lys Ser Lys 



Asp Phe Pro Thr 

70 

Leu Ser Lys Thr 
85 

His Phe lie Phe 
100 

He Pro Lys Gly 
115 

Ala Ser Gin Gly 



Phe Asn Pro Ala 

150 

Arg His Asp Asp 
165 

Gin Thr Gin Ala 
180 

Ser Ser Leu Val 
195 

Phe Pro Thr Ser 



Val Thr Lys Asp 

230 

He Ser Phe Ala 
245 



He He Leu 



Cys Gin Ser 
25 

Arg Lys Gly 
40 

Lys Thr Asn 
55 

Asp Asp Gly 



Asp Gin Gly 



Tyr Ala Asp 
105 

Ala Ser Leu 
120 

Thr Ser Lys 
135 

Asp He Val 



His Phe His 



Gin Ala Lys 
185 

Ser Thr Ala 
200 

Asp Gly Phe 
215 

Ser He Leu 



Asp Leu Arg 



Val Gly Leu Leu 
10 

Arg Gly Asn Gly 



Met Thr Ser Asn 

45 

Lys Thr His Lys 
60 

Phe He Leu Thr 
75 

He Val Val Asp 
90 

Leu Lys Gly Ser 



Ala Lys Pro Ala 

125 

Val Ala Asp Pro 
140 

Ala Glu Asp Ala 
155 

Tyr He Leu Lys 
170 

Gin Val Ala Thr 



Thr Ala Asn Gly 

205 

Gin Phe Asn Gly 
220 

Val Asp His Asp 
235 

Gin Gly Gly Trp 
250 



Leu Ser Ser 
15 

Thr Tyr Pro 
30 

Lys He Lys 



Gly Val Ala 



Lys Asp Ser 
80 

His Asp Gly 
95 

Pro Phe Glu 
110 

Val Ala Gin 



His His His 



Leu Gly Tyr 
160 

Ser Ser Leu 
175 

Arg Leu Pro 
190 

He Pro Gly 



Gin Gly He 



Gly His Leu 
240 

Ala His Val 
255 
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Ala Asp Gin 



His Gin Thr 
275 

Leu Ala Tyr 
290 

Arg Val Glu 
305 

Asp His Ala 



lie Pro Asp 

^ Lys Val Gly 

if 355 

m He Leu Asp 

Asn Glu Lys 

D 385 

Lys Leu Asp 

f*^ " 

Ser Leu Leu 



Lys Asp He 
435 

Met Thr Lys 
450 

Gin Leu Glu 
465 

Phe Leu Ser 



Gly He Glu 



Tyr Asp Pro Ala 
260 

Pro Glu Leu Ser 



Leu Ala Glu Lys 

295 

Thr Gin Asp Gly 
310 

His Val Leu Met 
325 

Pro His Ala He 
340 

Met Asp Thr Leu 



He Val Arg Thr 

375 

Asp Pro Asn Met 
390 

Leu Gly Ser Arg 
405 

Pro Asn Leu Glu 
420 

Ser Pro Val Leu 



Thr Gly Val Thr 

455 

Gly He Asp He 
470 

Lys Tyr Lys Asn 
485 

Asp He Arg Pro 
500 



Lys Lys Ala Glu Lys 
265 

Glu Arg Glu Lys Glu 
280 

Leu Gly He Asp Pro 

300 

Lys Leu Gly Leu Glu 

315 

Leu Ser Asp He Glu 
330 

Glu His Ala Arg Glu 
345 

Arg Ala Leu Gly Phe 
360 

His Asp Ala Pro Thr 

380 

Met Lys Glu Trp Leu 

395 

Lys Asp Pro Leu Gin 
410 

Thr Leu Gly He Gly 
425 

Gin Phe Lys Lys Leu 

440 

Asp Tyr Arg Phe Leu 

460 

Ser Gin Asn Asn Leu 

475 

Leu Thr Leu Val Ala 
490 

Leu Gly Gin Leu Pro 
505 



Pro Ala Glu Thr 
270 

Tyr Gin Glu Lys 
285 

Ser Thr He Lys 



Tyr Pro His His 

320 

He Gly Lys Asp 
335 

Leu Glu Lys His 
350 

Asp Glu Glu Val 
365 

Pro Phe Pro Ser 



Ala Thr Val He 

400 

Arg Lys Gly Leu 
415 

Phe Thr Pro He 
430 

Lys Gin Leu Leu 
445 

Asp Asn Met Pro 



Lys Asp He Ser 

480 

Ala Ala Asp Asn 
495 

Asn Leu Lys Phe 
510 
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Leu Val Leu 
515 

Leu His Gin 
530 

Leu Ser Pro 
545 

Arg Asn Ala 



Thr Leu Met 



Asn Asn Pro 
595 

i 

S Ser Leu Glu 
% 610 

i ^ 'i 

tQ Glu Gly Asn 

bii 625 

iii Leu Thr Phe 

Val Asn Asn 



Leu Thr Asn 
675 

Asp lie Ser 
690 

Gin His He 
705 

Gly Ser Met 



Thr Lys Ala 



Asn His Asn 
755 



Ser Asn Asn Lys 



Leu Gin Glu Leu 

535 

Val Ser His Lys 
550 

Asp Val Asp Leu 
565 

Val Asn Asp Thr 
580 

Asn Leu Ser Ser 



Gly He Glu Ala 

615 

Gin lie Lys Ser 
630 

Leu Asp Val Thr 
645 

Phe Thr Ala Leu 
660 

Val Asn Leu Ser 



His Asn Asn He 

695 

Pro Glu Ala He 
710 

Val Gly Asn Gly 
725 

Lys Glu Ser Ala 
740 

His Thr Tyr Glu 



He Ser Asp Leu Ser 
520 

His He Asp Asn Asn 

540 

Glu Ser Leu Thr Val 

555 

Ala Thr Leu Gin Ala 
570 

Lys Val Ser His Leu 
585 

Leu Ser He Asn Arg 
600 

Ser Ser Val He Val 

620 

Leu Val Leu Lys Asp 

635 

Gly Asn Gin Leu Thr 
650 

Asp He Leu Ser Val 
665 

Lys Pro Asn Lys Thr 
680 

Ser Leu Ala Asp Leu 

700 

Ala Lys Asn Phe Pro 

715 

Thr Ala Glu Glu Lys 
730 

Gin Glu Ala Ser Glu 
745 

Asp Glu Glu Gly His 
760 



Pro Leu Ala Ser 
525 

Gin He Thr Asp 



Val Asp Leu Ser 

560 

Pro Lys Leu Glu 
575 

Asp Phe Leu Lys 
590 

Ala Gin Leu Gin 
605 

Arg Val Glu Ala 



Lys Gin Gly Ser 

640 

Ser Leu Glu Gly 
655 

Ser Lys Asn Gin 
670 

Val Thr Asn He 

685 

Lys Leu Asn Glu 



Ala Val Tyr Glu 

720 

Ala Ala Met Ala 
735 

Ser His Asp Tyr 
750 

Ala His Glu His 
765 
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Arg Asp Lys Asp Asp His Asp His Glu His Glu Asp Glu Asn Glu Ala 
770 775 780 

Lys Asp Glu Gin Asn His Ala Asp 
785 790 



<210> 5 

<211> 2469 

<212> DNA 

<213> Streptococcus agalactiae 

<400> 5 

gtgaagaaaa catatggtta tatcggctca gttgctgcta ttttactagc tactcatatt 60 
ggaagttacc agcttggtaa gcatcatatg ggtctagcaa caaaggacaa tcagattgcc 120 
tatattgatg atagcaaagg taaggtaaaa gcccctaaaa caaacaaaac gatggatcaa 180 
atcagtgctg aagaaggcat ctctgctgaa cagatcgtag tcaaaattac tgaccaaggt 240 
tatgttacct cacacggtga ccattatcat ttttacaatg ggaaagttcc ttatgatgcg 300 
attattagtg aagagttgtt gatgacggat cctaattacc attttaaaca atcagacgtt 360 
atcaatgaaa tcttagacgg ttacgttatt aaagtcaatg gcaactatta tgtttacctc 42 0 
aagccaggta gtaagcgcaa aaacattcga accaaacaac aaattgctga gcaagtagcc 48 0 
aaaggaacta aagaagctaa agaaaaaggt ttagctcaag tggcccatct cagtaaagaa 54 0 
gaagttgcgg cagtcaatga agcaaaaaga caaggacgct atactacaga cgatggctat 600 
atttttagtc cgacagatat cattgatgat ttaggagatg cttatttagt acctcatggt 660 
aatcactatc attatattcc taaaaaagat ttgtctccaa gtgagctagc tgctgcacaa 720 
gcctactgga gtcaaaaaca aggtcgaggt gctagaccgt ctgattaccg cccgacacca 780 
gccccaggtc gtaggaaagc cccaattcct gatgtgacgc ctaaccctgg acaaggtcat 84 0 
cagccagata acggtggtta tcatccagcg cctcctaggc caaatgatgc gtcacaaaac 900 
aaacaccaaa gagatgagtt taaaggaaaa acctttaagg aacttttaga tcaactacac 960 
cgtcttgatt tgaaataccg tcatgtggaa gaagatgggt tgatttttga accgactcaa 1020 
gtgatcaaat caaacgcttt tgggtatgtg gtgcctcatg gagatcatta tcatattatc 1080 
ccaagaagtc agttatcacc acttgaaatg gaattagcag atcgatactt agccggccaa 114 0 
actgatgaca acgactcagg ttcagatcac tcaaaaccat cagataaaga agtgacacat 1200 
acctttcttg gtcatcgcat caaagcttac ggaaaaggct tagatggtaa accatatgat 126 0 
acgagtgatg cttatgtttt tagtaaagaa tccattcatt cagtggataa atcaggagtt 1320 
acagctaaac acggagatca tttccactat ataggatttg gagaacttga acaatatgag 13 80 
ttggatgagg tcgctaactg ggtgaaagca aaaggtcaag ctgatgagct tgttgctgct 1440 
ttggatcagg aacaaggcaa agaaaaacca ctctttgaca ctaaaaaagt gagtcgcaaa 1500 
gtaacaaaag atggtaaagt gggctatatt atgccaaaag atggcaagga ctatttctat 1560 
gctcgttatc aacttgattt gactcagatt gcctttgccg aacaagaact aatgcttaaa 1620 
gataagaagc attaccgtta tgacattgtt gatacaggca ttgagccacg acttgctgta 1680 
gatgtgtcaa gtctgccgat gcatgctggt aatgctactt acgatactgg aagttcgttt 1740 
gttatcccac atattgatca tatccatgtc gttccgtatt catggttgac gcgcaatcag 1800 
attgcaacaa tcaagtatgt gatgcaacac cccgaagttc gtccggatgt atggtctaag 1860 
ccagggcatg aagagtcagg ttcggtcatt ccaaatgtta cgcctcttga taaacgtgct 1920 
ggtatgccaa actggcaaat tatccattct gctgaagaag ttcaaaaagc cctagcagaa 1980 
ggtcgttttg cagcaccaga cggctatatt ttcgatccac gagatgtttt ggcaaaagaa 2040 
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acttttgtat ggaaagatgg ctcctttagc atcccaagag cagatggcag ttcattgaga 2100 
accattaata aatccgatct atcccaagct gagtggcaac aagctcaaga gttattggca 2160 
aagaaaaatg ctggtgatgc tactgatacg gataaacctg aagaaaagca acaggcagat 2220 
aagagcaatg aaaaccaaca gccaagtgaa gccagtaaag aagaaaaaga atcagatgac 228 0 
tttatagaca gtttaccaga ctatggtcta gatagagcaa ccctagaaga tcatatcaat 2340 
caattagcac aaaaagctaa tatcgatcct aagtatctca ttttccaacc agaaggtgtc 2400 
caattttata ataaaaatgg tgaattggta acttatgata tcaagacact tcaacaaata 2460 
aacccttaa 2469 



<210> 6 
<211> 822 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 6 

Val Lys Lys Thr Tyr Gly Tyr lie Gly Ser Val Ala Ala lie Leu Leu 
15 10 15 

Ala Thr His lie Gly Ser Tyr Gin Leu Gly Lys His His Met Gly Leu 

20 25 30 

Ala Thr Lys Asp Asn Gin lie Ala Tyr lie Asp Asp Ser Lys Gly Lys 
35 40 45 

Val Lys Ala Pro Lys Thr Asn Lys Thr Met Asp Gin lie Ser Ala Glu 
50 55 60 

Glu Gly lie Ser Ala Glu Gin lie Val Val Lys lie Thr Asp Gin Gly 
65 70 75 80 

Tyr Val Thr Ser His Gly Asp His Tyr His Phe Tyr Asn Gly Lys Val 

85 90 95 

Pro Tyr Asp Ala lie lie Ser Glu Glu Leu Leu Met Thr Asp Pro Asn 

100 105 110 

Tyr His Phe Lys Gin Ser Asp Val lie Asn Glu lie Leu Asp Gly Tyr 
115 120 125 

Val lie Lys Val Asn Gly Asn Tyr Tyr Val Tyr Leu Lys Pro Gly Ser 
130 135 140 

Lys Arg Lys Asn lie Arg Thr Lys Gin Gin He Ala Glu Gin Val Ala 
145 150 155 160 

Lys Gly Thr Lys Glu Ala Lys Glu Lys Gly Leu Ala Gin Val Ala His 

165 170 175 
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Leu Ser Lys Glu 

180 

Arg Tyr Thr Thr 
195 

Asp Asp Leu Gly 
210 

Tyr lie Pro Lys 
225 

Ala Tyr Trp Ser 

Arg Pro Thr Pro 

260 

Thr Pro Asn Pro 
275 

Pro Ala Pro Pro 
290 

Asp Glu Phe Lys 
305 

Arg Leu Asp Leu 



Glu Pro Thr Gin 

340 

His Gly Asp His 
355 

Glu Met Glu Leu 
370 

Asp Ser Gly Ser 
385 

Thr Phe Leu Gly 



Lys Pro Tyr Asp 

420 



Glu Val Ala Ala 



Asp Asp Gly Tyr 

200 

Asp Ala Tyr Leu 
215 

Lys Asp Leu Ser 
230 

Gin Lys Gin Gly 
245 

Ala Pro Gly Arg 



Gly Gin Gly His 

280 

Arg Pro Asn Asp 
295 

Gly Lys Thr Phe 
310 

Lys Tyr Arg His 
325 

Val lie Lys Ser 



Tyr His lie lie 

360 

Ala Asp Arg Tyr 
375 

Asp His Ser Lys 
390 

His Arg lie Lys 
405 

Thr Ser Asp Ala 



Val Asn Glu Ala 
185 

lie Phe Ser Pro 



Val Pro His Gly 

220 

Pro Ser Glu Leu 

235 

Arg Gly Ala Arg 
250 

Arg Lys Ala Pro 
265 

Gin Pro Asp Asn 



Ala Ser Gin Asn 

300 

Lys Glu Leu Leu 
315 

Val Glu Glu Asp 
330 

Asn Ala Phe Gly 
345 

Pro Arg Ser Gin 



Leu Ala Gly Gin 

380 

Pro Ser Asp Lys 
395 

Ala Tyr Gly Lys 
410 

Tyr Val Phe Ser 
425 



Lys Arg Gin Gly 
190 

Thr Asp lie lie 
205 

Asn His Tyr His 



Ala Ala Ala Gin 

240 

Pro Ser Asp Tyr 
255 

lie Pro Asp Val 
270 

Gly Gly Tyr His 
285 

Lys His Gin Arg 



Asp Gin Leu His 

320 

Gly Leu lie Phe 
335 

Tyr Val Val Pro 
350 

Leu Ser Pro Leu 
365 

Thr Asp Asp Asn 



Glu Val Thr His 

400 

Gly Leu Asp Gly 
415 

Lys Glu Ser lie 
430 



His Ser Val Asp 
435 

His Tyr lie Gly 
450 

Ala Asn Trp Val 
465 

Leu Asp Gin Glu 



Val Ser Arg Lys 

500 

Lys Asp Gly Lys 
515 

Gin lie Ala Phe 
53 0 

Tyr Arg Tyr Asp 
545 

Asp Val Ser Ser 



Gly Ser Ser Phe 

580 

Tyr Ser Trp Leu 
595 

Gin His Pro Glu 
610 

Glu Ser Gly Ser 
625 

Gly Met Pro Asn 



Ala Leu Ala Glu 

660 

Pro Arg Asp Val 
675 



Lys Ser Gly Val 

440 

Phe Gly Glu Leu 
455 

Lys Ala Lys Gly 
470 

Gin Gly Lys Glu 
485 

Val Thr Lys Asp 



Asp Tyr Phe Tyr 

520 

Ala Glu Gin Glu 
535 

lie Val Asp Thr 
550 

Leu Pro Met His 
565 

Val lie Pro His 



Thr Arg Asn Gin 

600 

Val Arg Pro Asp 
615 

Val lie Pro Asn 
63 0 

Trp Gin lie lie 
645 

Gly Arg Phe Ala 



Leu Ala Lys Glu 

680 



Thr Ala Lys His 



Glu Gin Tyr Glu 

460 

Gin Ala Asp Glu 
475 

Lys Pro Leu Phe 
490 

Gly Lys Val Gly 
505 

Ala Arg Tyr Gin 



Leu Met Leu Lys 

540 

Gly lie Glu Pro 
555 

Ala Gly Asn Ala 
570 

lie Asp His lie 
585 

lie Ala Thr lie 



Val Trp Ser Lys 

620 

Val Thr Pro Leu 
635 

His Ser Ala Glu 
650 

Ala Pro Asp Gly 
665 

Thr Phe Val Trp 



Gly Asp His Phe 
445 

Leu Asp Glu Val 



Leu Val Ala Ala 

480 

Asp Thr Lys Lys 
495 

Tyr lie Met Pro 
510 

Leu Asp Leu Thr 
525 

Asp Lys Lys His 



Arg Leu Ala Val 

560 

Thr Tyr Asp Thr 

575 

His Val Val Pro 
590 

Lys Tyr Val Met 
605 

Pro Gly His Glu 



Asp Lys Arg Ala 

640 

Glu Val Gin Lys 
655 

Tyr lie Phe Asp 
670 

Lys Asp Gly Ser 
685 
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Phe Ser lie Pro 
690 

Ser Asp Leu Ser 
705 

Lys Lys Asn Ala 



Gin Gin Ala Asp 

740 

Lys Glu Glu Lys 
755 

Gly Leu Asp Arg 
770 

Lys Ala Asn lie 
785 

Gin Phe Tyr Asn 



Arg Ala Asp Gly 
695 

Gin Ala Glu Trp 
710 

Gly Asp Ala Thr 

725 

Lys Ser Asn Glu 



Glu Ser Asp Asp 

760 

Ala Thr Leu Glu 
775 

Asp Pro Lys Tyr 
790 

Lys Asn Gly Glu 
805 



Ser Ser Leu Arg 

700 

Gin Gin Ala Gin 
715 

Asp Thr Asp Lys 
730 

Asn Gin Gin Pro 
745 

Phe lie Asp Ser 



Asp His He Asn 

780 

Leu He Phe Gin 
795 

Leu Val Thr Tyr 
810 



Thr He Asn Lys 



Glu Leu Leu Ala 

720 

Pro Glu Glu Lys 
735 

Ser Glu Ala Ser 
750 

Leu Pro Asp Tyr 
765 

Gin Leu Ala Gin 



Pro Glu Gly Val 

800 

Asp He Lys Thr 
815 



Leu Gin Gin He Asn Pro 

820 
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